Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 515738 |
| Missing cells | 6536 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 336 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 70.8 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Text | 7 |
|---|---|
| Numeric | 9 |
| DateTime | 1 |
| Categorical | 1 |
| Dataset has 336 (0.1%) duplicate rows | Duplicates |
additional_number_of_scoring is highly overall correlated with total_number_of_reviews | High correlation |
reviewer_score is highly overall correlated with sample | High correlation |
sample is highly overall correlated with reviewer_score | High correlation |
total_number_of_reviews is highly overall correlated with additional_number_of_scoring | High correlation |
review_total_negative_word_counts has 127890 (24.8%) zeros | Zeros |
review_total_positive_word_counts has 35946 (7.0%) zeros | Zeros |
reviewer_score has 128935 (25.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-07 11:01:45.553309 |
|---|---|
| Analysis finished | 2025-05-07 11:02:23.688213 |
| Duration | 38.13 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
hotel_address
Text
| Distinct | 1493 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 96 |
|---|---|
| Median length | 78 |
| Mean length | 59.879357 |
| Min length | 34 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Via Senigallia 6 20161 Milan Italy |
|---|---|
| 2nd row | Arlandaweg 10 Westpoort 1043 EW Amsterdam Netherlands |
| 3rd row | Mallorca 251 Eixample 08008 Barcelona Spain |
| 4th row | Piazza Della Repubblica 17 Central Station 20124 Milan Italy |
| 5th row | Singel 303 309 Amsterdam City Center 1012 WJ Amsterdam Netherlands |
| Value | Count | Frequency (%) |
| london | 283657 | 5.7% |
| kingdom | 262692 | 5.2% |
| united | 262301 | 5.2% |
| westminster | 95105 | 1.9% |
| borough | 90619 | 1.8% |
| amsterdam | 82634 | 1.7% |
| city | 62594 | 1.3% |
| barcelona | 61231 | 1.2% |
| street | 60434 | 1.2% |
| spain | 60149 | 1.2% |
| Other values (2525) | 3684801 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4490884 | 14.5% | |
| n | 2381541 | 7.7% |
| e | 2224973 | 7.2% |
| a | 1876434 | 6.1% |
| o | 1633811 | 5.3% |
| t | 1600746 | 5.2% |
| r | 1481491 | 4.8% |
| i | 1459499 | 4.7% |
| d | 1358260 | 4.4% |
| s | 939735 | 3.0% |
| Other values (53) | 11434686 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 30882060 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4490884 | 14.5% | |
| n | 2381541 | 7.7% |
| e | 2224973 | 7.2% |
| a | 1876434 | 6.1% |
| o | 1633811 | 5.3% |
| t | 1600746 | 5.2% |
| r | 1481491 | 4.8% |
| i | 1459499 | 4.7% |
| d | 1358260 | 4.4% |
| s | 939735 | 3.0% |
| Other values (53) | 11434686 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 30882060 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4490884 | 14.5% | |
| n | 2381541 | 7.7% |
| e | 2224973 | 7.2% |
| a | 1876434 | 6.1% |
| o | 1633811 | 5.3% |
| t | 1600746 | 5.2% |
| r | 1481491 | 4.8% |
| i | 1459499 | 4.7% |
| d | 1358260 | 4.4% |
| s | 939735 | 3.0% |
| Other values (53) | 11434686 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 30882060 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4490884 | 14.5% | |
| n | 2381541 | 7.7% |
| e | 2224973 | 7.2% |
| a | 1876434 | 6.1% |
| o | 1633811 | 5.3% |
| t | 1600746 | 5.2% |
| r | 1481491 | 4.8% |
| i | 1459499 | 4.7% |
| d | 1358260 | 4.4% |
| s | 939735 | 3.0% |
| Other values (53) | 11434686 |
additional_number_of_scoring
Real number (ℝ)
High correlation 
| Distinct | 480 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 498.08184 |
| Minimum | 1 |
|---|---|
| Maximum | 2682 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 57 |
| Q1 | 169 |
| median | 341 |
| Q3 | 660 |
| 95-th percentile | 1444 |
| Maximum | 2682 |
| Range | 2681 |
| Interquartile range (IQR) | 491 |
Descriptive statistics
| Standard deviation | 500.53847 |
|---|---|
| Coefficient of variation (CV) | 1.0049322 |
| Kurtosis | 5.751927 |
| Mean | 498.08184 |
| Median Absolute Deviation (MAD) | 201 |
| Skewness | 2.2077514 |
| Sum | 2.5687973 × 108 |
| Variance | 250538.76 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2682 | 4789 | 0.9% |
| 2288 | 4256 | 0.8% |
| 2623 | 4169 | 0.8% |
| 1831 | 3578 | 0.7% |
| 1936 | 3212 | 0.6% |
| 256 | 3079 | 0.6% |
| 1274 | 2958 | 0.6% |
| 832 | 2934 | 0.6% |
| 211 | 2858 | 0.6% |
| 404 | 2836 | 0.5% |
| Other values (470) | 481069 |
| Value | Count | Frequency (%) |
| 1 | 13 | < 0.1% |
| 4 | 12 | < 0.1% |
| 5 | 39 | < 0.1% |
| 6 | 118 | |
| 7 | 56 | < 0.1% |
| 8 | 57 | < 0.1% |
| 9 | 89 | |
| 10 | 195 | |
| 11 | 151 | |
| 12 | 67 | < 0.1% |
| Value | Count | Frequency (%) |
| 2682 | 4789 | |
| 2623 | 4169 | |
| 2288 | 4256 | |
| 1936 | 3212 | |
| 1831 | 3578 | |
| 1485 | 2628 | |
| 1471 | 2155 | |
| 1444 | 2565 | |
| 1427 | 2227 | |
| 1322 | 2223 |
review_date
Date
| Distinct | 731 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
| Minimum | 2015-08-04 00:00:00 |
|---|---|
| Maximum | 2017-08-03 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
average_score
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.3974869 |
| Minimum | 5.2 |
|---|---|
| Maximum | 9.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 5.2 |
|---|---|
| 5-th percentile | 7.4 |
| Q1 | 8.1 |
| median | 8.4 |
| Q3 | 8.8 |
| 95-th percentile | 9.2 |
| Maximum | 9.8 |
| Range | 4.6 |
| Interquartile range (IQR) | 0.7 |
Descriptive statistics
| Standard deviation | 0.54804817 |
|---|---|
| Coefficient of variation (CV) | 0.065263355 |
| Kurtosis | 0.4223857 |
| Mean | 8.3974869 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | -0.54524262 |
| Sum | 4330903.1 |
| Variance | 0.3003568 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.4 | 41222 | 8.0% |
| 8.1 | 38122 | 7.4% |
| 8.5 | 38066 | 7.4% |
| 8.7 | 37798 | 7.3% |
| 8.6 | 36945 | 7.2% |
| 8.2 | 34847 | 6.8% |
| 8.3 | 32880 | 6.4% |
| 8.8 | 30836 | 6.0% |
| 8.9 | 28520 | 5.5% |
| 8 | 22342 | 4.3% |
| Other values (24) | 174160 |
| Value | Count | Frequency (%) |
| 5.2 | 65 | < 0.1% |
| 6.4 | 1163 | 0.2% |
| 6.6 | 400 | 0.1% |
| 6.7 | 965 | 0.2% |
| 6.8 | 1335 | 0.3% |
| 6.9 | 1737 | 0.3% |
| 7 | 3899 | |
| 7.1 | 6780 | |
| 7.2 | 684 | 0.1% |
| 7.3 | 3997 |
| Value | Count | Frequency (%) |
| 9.8 | 28 | < 0.1% |
| 9.6 | 915 | 0.2% |
| 9.5 | 1207 | 0.2% |
| 9.4 | 9339 | 1.8% |
| 9.3 | 12659 | |
| 9.2 | 12935 | |
| 9.1 | 21379 | |
| 9 | 21051 | |
| 8.9 | 28520 | |
| 8.8 | 30836 |
hotel_name
Text
| Distinct | 1492 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 43 |
| Mean length | 25.30696 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hotel Da Vinci |
|---|---|
| 2nd row | Urban Lodge Hotel |
| 3rd row | Alexandra Barcelona A DoubleTree by Hilton |
| 4th row | Hotel Principe Di Savoia |
| 5th row | Hotel Esther a |
| Value | Count | Frequency (%) |
| hotel | 234986 | 11.6% |
| london | 137227 | 6.8% |
| the | 58689 | 2.9% |
| park | 43929 | 2.2% |
| amsterdam | 39868 | 2.0% |
| hilton | 35490 | 1.8% |
| by | 26928 | 1.3% |
| plaza | 23105 | 1.1% |
| paris | 21792 | 1.1% |
| grand | 18430 | 0.9% |
| Other values (1595) | 1386601 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1513162 | 11.6% | |
| e | 1229379 | 9.4% |
| o | 1085093 | 8.3% |
| n | 970893 | 7.4% |
| a | 904023 | 6.9% |
| t | 816440 | 6.3% |
| l | 768209 | 5.9% |
| r | 726795 | 5.6% |
| i | 570540 | 4.4% |
| s | 435214 | 3.3% |
| Other values (53) | 4032013 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13051761 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1513162 | 11.6% | |
| e | 1229379 | 9.4% |
| o | 1085093 | 8.3% |
| n | 970893 | 7.4% |
| a | 904023 | 6.9% |
| t | 816440 | 6.3% |
| l | 768209 | 5.9% |
| r | 726795 | 5.6% |
| i | 570540 | 4.4% |
| s | 435214 | 3.3% |
| Other values (53) | 4032013 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13051761 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1513162 | 11.6% | |
| e | 1229379 | 9.4% |
| o | 1085093 | 8.3% |
| n | 970893 | 7.4% |
| a | 904023 | 6.9% |
| t | 816440 | 6.3% |
| l | 768209 | 5.9% |
| r | 726795 | 5.6% |
| i | 570540 | 4.4% |
| s | 435214 | 3.3% |
| Other values (53) | 4032013 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13051761 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1513162 | 11.6% | |
| e | 1229379 | 9.4% |
| o | 1085093 | 8.3% |
| n | 970893 | 7.4% |
| a | 904023 | 6.9% |
| t | 816440 | 6.3% |
| l | 768209 | 5.9% |
| r | 726795 | 5.6% |
| i | 570540 | 4.4% |
| s | 435214 | 3.3% |
| Other values (53) | 4032013 |
| Distinct | 227 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 34 |
| Mean length | 14.036272 |
| Min length | 1 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United Kingdom |
|---|---|
| 2nd row | Belgium |
| 3rd row | Sweden |
| 4th row | United States of America |
| 5th row | United Kingdom |
| Value | Count | Frequency (%) |
| united | 290992 | |
| kingdom | 245246 | |
| of | 35851 | 3.9% |
| states | 35511 | 3.9% |
| america | 35437 | 3.9% |
| australia | 21686 | 2.4% |
| ireland | 14827 | 1.6% |
| arab | 10235 | 1.1% |
| emirates | 10235 | 1.1% |
| saudi | 8951 | 1.0% |
| Other values (262) | 203907 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1428616 | ||
| i | 707670 | |
| n | 664608 | |
| d | 607285 | 8.4% |
| e | 500817 | 6.9% |
| t | 445571 | 6.2% |
| a | 370806 | 5.1% |
| o | 323726 | 4.5% |
| m | 315246 | 4.4% |
| U | 292269 | 4.0% |
| Other values (42) | 1582425 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7239039 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1428616 | ||
| i | 707670 | |
| n | 664608 | |
| d | 607285 | 8.4% |
| e | 500817 | 6.9% |
| t | 445571 | 6.2% |
| a | 370806 | 5.1% |
| o | 323726 | 4.5% |
| m | 315246 | 4.4% |
| U | 292269 | 4.0% |
| Other values (42) | 1582425 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7239039 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1428616 | ||
| i | 707670 | |
| n | 664608 | |
| d | 607285 | 8.4% |
| e | 500817 | 6.9% |
| t | 445571 | 6.2% |
| a | 370806 | 5.1% |
| o | 323726 | 4.5% |
| m | 315246 | 4.4% |
| U | 292269 | 4.0% |
| Other values (42) | 1582425 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7239039 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1428616 | ||
| i | 707670 | |
| n | 664608 | |
| d | 607285 | 8.4% |
| e | 500817 | 6.9% |
| t | 445571 | 6.2% |
| a | 370806 | 5.1% |
| o | 323726 | 4.5% |
| m | 315246 | 4.4% |
| U | 292269 | 4.0% |
| Other values (42) | 1582425 |
negative_review
Text
| Distinct | 330011 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 1966 |
|---|---|
| Median length | 1910 |
| Mean length | 93.798167 |
| Min length | 1 |
Unique
| Unique | 323546 ? |
|---|---|
| Unique (%) | 62.7% |
Sample
| 1st row | Would have appreciated a shop in the hotel that sold drinking water etc but not necessity Would recommend if like us you arrive late at night to bring drinks from plane airport as there s no shop nearby There is a minibar though if you want to pay those prices |
|---|---|
| 2nd row | No tissue paper box was present at the room |
| 3rd row | Pillows |
| 4th row | No Negative |
| 5th row | No Negative |
| Value | Count | Frequency (%) |
| the | 531268 | 5.8% |
| was | 236750 | 2.6% |
| a | 230251 | 2.5% |
| to | 228892 | 2.5% |
| and | 219473 | 2.4% |
| no | 197882 | 2.1% |
| room | 176026 | 1.9% |
| in | 168040 | 1.8% |
| negative | 129447 | 1.4% |
| not | 125701 | 1.4% |
| Other values (55627) | 6961291 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9301533 | ||
| e | 4710124 | 9.7% |
| t | 3568261 | 7.4% |
| o | 3563252 | 7.4% |
| a | 3077470 | 6.4% |
| i | 2442432 | 5.0% |
| n | 2399642 | 5.0% |
| r | 2321682 | 4.8% |
| s | 2085748 | 4.3% |
| h | 1783967 | 3.7% |
| Other values (53) | 13121168 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48375279 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9301533 | ||
| e | 4710124 | 9.7% |
| t | 3568261 | 7.4% |
| o | 3563252 | 7.4% |
| a | 3077470 | 6.4% |
| i | 2442432 | 5.0% |
| n | 2399642 | 5.0% |
| r | 2321682 | 4.8% |
| s | 2085748 | 4.3% |
| h | 1783967 | 3.7% |
| Other values (53) | 13121168 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48375279 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9301533 | ||
| e | 4710124 | 9.7% |
| t | 3568261 | 7.4% |
| o | 3563252 | 7.4% |
| a | 3077470 | 6.4% |
| i | 2442432 | 5.0% |
| n | 2399642 | 5.0% |
| r | 2321682 | 4.8% |
| s | 2085748 | 4.3% |
| h | 1783967 | 3.7% |
| Other values (53) | 13121168 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48375279 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9301533 | ||
| e | 4710124 | 9.7% |
| t | 3568261 | 7.4% |
| o | 3563252 | 7.4% |
| a | 3077470 | 6.4% |
| i | 2442432 | 5.0% |
| n | 2399642 | 5.0% |
| r | 2321682 | 4.8% |
| s | 2085748 | 4.3% |
| h | 1783967 | 3.7% |
| Other values (53) | 13121168 |
review_total_negative_word_counts
Real number (ℝ)
Zeros 
| Distinct | 402 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.53945 |
| Minimum | 0 |
|---|---|
| Maximum | 408 |
| Zeros | 127890 |
| Zeros (%) | 24.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 9 |
| Q3 | 23 |
| 95-th percentile | 69 |
| Maximum | 408 |
| Range | 408 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 29.690831 |
|---|---|
| Coefficient of variation (CV) | 1.6014947 |
| Kurtosis | 31.413626 |
| Mean | 18.53945 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 4.407949 |
| Sum | 9561499 |
| Variance | 881.54543 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 127890 | |
| 2 | 24647 | 4.8% |
| 3 | 18144 | 3.5% |
| 6 | 17749 | 3.4% |
| 5 | 16809 | 3.3% |
| 7 | 16140 | 3.1% |
| 4 | 15063 | 2.9% |
| 8 | 14716 | 2.9% |
| 9 | 13641 | 2.6% |
| 10 | 12422 | 2.4% |
| Other values (392) | 238517 |
| Value | Count | Frequency (%) |
| 0 | 127890 | |
| 2 | 24647 | 4.8% |
| 3 | 18144 | 3.5% |
| 4 | 15063 | 2.9% |
| 5 | 16809 | 3.3% |
| 6 | 17749 | 3.4% |
| 7 | 16140 | 3.1% |
| 8 | 14716 | 2.9% |
| 9 | 13641 | 2.6% |
| 10 | 12422 | 2.4% |
| Value | Count | Frequency (%) |
| 408 | 1 | |
| 403 | 2 | |
| 402 | 2 | |
| 401 | 1 | |
| 400 | 1 | |
| 399 | 2 | |
| 398 | 1 | |
| 397 | 1 | |
| 395 | 1 | |
| 393 | 2 |
total_number_of_reviews
Real number (ℝ)
High correlation 
| Distinct | 1142 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2743.7439 |
| Minimum | 43 |
|---|---|
| Maximum | 16670 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 43 |
|---|---|
| 5-th percentile | 435 |
| Q1 | 1161 |
| median | 2134 |
| Q3 | 3613 |
| 95-th percentile | 7371 |
| Maximum | 16670 |
| Range | 16627 |
| Interquartile range (IQR) | 2452 |
Descriptive statistics
| Standard deviation | 2317.4649 |
|---|---|
| Coefficient of variation (CV) | 0.84463598 |
| Kurtosis | 6.4210843 |
| Mean | 2743.7439 |
| Median Absolute Deviation (MAD) | 1118 |
| Skewness | 2.0861703 |
| Sum | 1.415053 × 109 |
| Variance | 5370643.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9086 | 4789 | 0.9% |
| 9568 | 4256 | 0.8% |
| 12158 | 4169 | 0.8% |
| 7105 | 3578 | 0.7% |
| 7491 | 3212 | 0.6% |
| 6539 | 2958 | 0.6% |
| 5945 | 2768 | 0.5% |
| 6977 | 2628 | 0.5% |
| 5726 | 2565 | 0.5% |
| 4204 | 2551 | 0.5% |
| Other values (1132) | 482264 |
| Value | Count | Frequency (%) |
| 43 | 12 | < 0.1% |
| 45 | 12 | < 0.1% |
| 49 | 40 | |
| 51 | 13 | < 0.1% |
| 54 | 13 | < 0.1% |
| 59 | 75 | |
| 60 | 23 | < 0.1% |
| 61 | 17 | < 0.1% |
| 64 | 31 | |
| 66 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 16670 | 1877 | 0.4% |
| 12158 | 4169 | |
| 10842 | 1118 | 0.2% |
| 9568 | 4256 | |
| 9086 | 4789 | |
| 8177 | 1809 | 0.4% |
| 7656 | 1576 | 0.3% |
| 7586 | 1686 | 0.3% |
| 7491 | 3212 | |
| 7371 | 1335 | 0.3% |
positive_review
Text
| Distinct | 412601 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 1960 |
|---|---|
| Median length | 1841 |
| Mean length | 94.621069 |
| Min length | 1 |
Unique
| Unique | 403184 ? |
|---|---|
| Unique (%) | 78.2% |
Sample
| 1st row | Hotel was great clean friendly staff free breakfast every morning with good selection good wifi connection nice sized room with bath fridge in room Personally loved the fact that the hotel isn t in the city centre but is literally next to a train station that you can easily get to and from the airport city Would definitely stay again |
|---|---|
| 2nd row | No Positive |
| 3rd row | Nice welcoming and service |
| 4th row | Everything including the nice upgrade The Hotel has been revamped and what a surprise Love every second of it including in room dining which was excellent |
| 5th row | Lovely hotel v welcoming staff |
| Value | Count | Frequency (%) |
| the | 515247 | 6.1% |
| and | 420617 | 5.0% |
| was | 236743 | 2.8% |
| staff | 194574 | 2.3% |
| location | 192856 | 2.3% |
| very | 192743 | 2.3% |
| to | 187933 | 2.2% |
| a | 164977 | 1.9% |
| room | 140746 | 1.7% |
| hotel | 125326 | 1.5% |
| Other values (51225) | 6120421 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8724132 | ||
| e | 4823582 | 9.9% |
| a | 3497026 | 7.2% |
| o | 3418374 | 7.0% |
| t | 3414477 | 7.0% |
| n | 2492661 | 5.1% |
| r | 2427626 | 5.0% |
| i | 2342431 | 4.8% |
| s | 2116857 | 4.3% |
| l | 2084944 | 4.3% |
| Other values (53) | 13457571 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48799681 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8724132 | ||
| e | 4823582 | 9.9% |
| a | 3497026 | 7.2% |
| o | 3418374 | 7.0% |
| t | 3414477 | 7.0% |
| n | 2492661 | 5.1% |
| r | 2427626 | 5.0% |
| i | 2342431 | 4.8% |
| s | 2116857 | 4.3% |
| l | 2084944 | 4.3% |
| Other values (53) | 13457571 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48799681 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8724132 | ||
| e | 4823582 | 9.9% |
| a | 3497026 | 7.2% |
| o | 3418374 | 7.0% |
| t | 3414477 | 7.0% |
| n | 2492661 | 5.1% |
| r | 2427626 | 5.0% |
| i | 2342431 | 4.8% |
| s | 2116857 | 4.3% |
| l | 2084944 | 4.3% |
| Other values (53) | 13457571 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48799681 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8724132 | ||
| e | 4823582 | 9.9% |
| a | 3497026 | 7.2% |
| o | 3418374 | 7.0% |
| t | 3414477 | 7.0% |
| n | 2492661 | 5.1% |
| r | 2427626 | 5.0% |
| i | 2342431 | 4.8% |
| s | 2116857 | 4.3% |
| l | 2084944 | 4.3% |
| Other values (53) | 13457571 |
review_total_positive_word_counts
Real number (ℝ)
Zeros 
| Distinct | 365 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.776458 |
| Minimum | 0 |
|---|---|
| Maximum | 395 |
| Zeros | 35946 |
| Zeros (%) | 7.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 11 |
| Q3 | 22 |
| 95-th percentile | 56 |
| Maximum | 395 |
| Range | 395 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 21.804185 |
|---|---|
| Coefficient of variation (CV) | 1.2265765 |
| Kurtosis | 32.943045 |
| Mean | 17.776458 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 4.1911321 |
| Sum | 9167995 |
| Variance | 475.42249 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35946 | 7.0% |
| 6 | 26921 | 5.2% |
| 5 | 26844 | 5.2% |
| 4 | 24656 | 4.8% |
| 7 | 24538 | 4.8% |
| 8 | 23238 | 4.5% |
| 3 | 22533 | 4.4% |
| 9 | 21208 | 4.1% |
| 2 | 20934 | 4.1% |
| 10 | 19611 | 3.8% |
| Other values (355) | 269309 |
| Value | Count | Frequency (%) |
| 0 | 35946 | |
| 2 | 20934 | |
| 3 | 22533 | |
| 4 | 24656 | |
| 5 | 26844 | |
| 6 | 26921 | |
| 7 | 24538 | |
| 8 | 23238 | |
| 9 | 21208 | |
| 10 | 19611 |
| Value | Count | Frequency (%) |
| 395 | 1 | |
| 386 | 1 | |
| 384 | 2 | |
| 383 | 2 | |
| 382 | 1 | |
| 380 | 1 | |
| 378 | 1 | |
| 377 | 1 | |
| 375 | 2 | |
| 374 | 1 |
total_number_of_reviews_reviewer_has_given
Real number (ℝ)
| Distinct | 198 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.166001 |
| Minimum | 1 |
|---|---|
| Maximum | 355 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 26 |
| Maximum | 355 |
| Range | 354 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 11.040228 |
|---|---|
| Coefficient of variation (CV) | 1.54064 |
| Kurtosis | 51.479447 |
| Mean | 7.166001 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.0875667 |
| Sum | 3695779 |
| Variance | 121.88663 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 154640 | |
| 2 | 67077 | |
| 3 | 46845 | 9.1% |
| 4 | 35026 | 6.8% |
| 5 | 27629 | 5.4% |
| 6 | 22621 | 4.4% |
| 7 | 18614 | 3.6% |
| 8 | 16150 | 3.1% |
| 9 | 13545 | 2.6% |
| 10 | 11717 | 2.3% |
| Other values (188) | 101874 |
| Value | Count | Frequency (%) |
| 1 | 154640 | |
| 2 | 67077 | |
| 3 | 46845 | 9.1% |
| 4 | 35026 | 6.8% |
| 5 | 27629 | 5.4% |
| 6 | 22621 | 4.4% |
| 7 | 18614 | 3.6% |
| 8 | 16150 | 3.1% |
| 9 | 13545 | 2.6% |
| 10 | 11717 | 2.3% |
| Value | Count | Frequency (%) |
| 355 | 1 | < 0.1% |
| 330 | 1 | < 0.1% |
| 315 | 4 | |
| 297 | 2 | |
| 281 | 2 | |
| 270 | 2 | |
| 250 | 3 | |
| 239 | 1 | < 0.1% |
| 237 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
tags
Text
| Distinct | 55242 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 213 |
|---|---|
| Median length | 178 |
| Mean length | 102.41794 |
| Min length | 11 |
Unique
| Unique | 29892 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | [' Leisure trip ', ' Couple ', ' Double Room ', ' Stayed 2 nights '] |
|---|---|
| 2nd row | [' Leisure trip ', ' Group ', ' Triple Room ', ' Stayed 1 night '] |
| 3rd row | [' Business trip ', ' Solo traveler ', ' Twin Room ', ' Stayed 1 night ', ' Submitted from a mobile device '] |
| 4th row | [' Leisure trip ', ' Couple ', ' Ambassador Junior Suite ', ' Stayed 1 night '] |
| 5th row | [' Business trip ', ' Solo traveler ', ' Classic Double or Twin Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] |
| Value | Count | Frequency (%) |
| 4713184 | ||
| stayed | 515546 | 4.5% |
| trip | 500717 | 4.4% |
| room | 467443 | 4.1% |
| leisure | 417900 | 3.6% |
| nights | 321909 | 2.8% |
| a | 309360 | 2.7% |
| from | 307963 | 2.7% |
| mobile | 307693 | 2.7% |
| device | 307640 | 2.7% |
| Other values (644) | 3306475 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10960092 | ||
| ' | 4713184 | 8.9% |
| e | 4064833 | 7.7% |
| i | 3223273 | 6.1% |
| o | 2787265 | 5.3% |
| t | 2606633 | 4.9% |
| r | 2080930 | 3.9% |
| , | 1840854 | 3.5% |
| u | 1792540 | 3.4% |
| m | 1539332 | 2.9% |
| Other values (57) | 17211886 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 52820822 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 10960092 | ||
| ' | 4713184 | 8.9% |
| e | 4064833 | 7.7% |
| i | 3223273 | 6.1% |
| o | 2787265 | 5.3% |
| t | 2606633 | 4.9% |
| r | 2080930 | 3.9% |
| , | 1840854 | 3.5% |
| u | 1792540 | 3.4% |
| m | 1539332 | 2.9% |
| Other values (57) | 17211886 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 52820822 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 10960092 | ||
| ' | 4713184 | 8.9% |
| e | 4064833 | 7.7% |
| i | 3223273 | 6.1% |
| o | 2787265 | 5.3% |
| t | 2606633 | 4.9% |
| r | 2080930 | 3.9% |
| , | 1840854 | 3.5% |
| u | 1792540 | 3.4% |
| m | 1539332 | 2.9% |
| Other values (57) | 17211886 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 52820822 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 10960092 | ||
| ' | 4713184 | 8.9% |
| e | 4064833 | 7.7% |
| i | 3223273 | 6.1% |
| o | 2787265 | 5.3% |
| t | 2606633 | 4.9% |
| r | 2080930 | 3.9% |
| , | 1840854 | 3.5% |
| u | 1792540 | 3.4% |
| m | 1539332 | 2.9% |
| Other values (57) | 17211886 |
| Distinct | 731 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9839589 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 13 days |
|---|---|
| 2nd row | 234 day |
| 3rd row | 616 day |
| 4th row | 656 day |
| 5th row | 444 day |
| Value | Count | Frequency (%) |
| day | 439997 | |
| days | 75741 | 7.3% |
| 1 | 2585 | 0.3% |
| 322 | 2308 | 0.2% |
| 120 | 2284 | 0.2% |
| 338 | 1963 | 0.2% |
| 534 | 1940 | 0.2% |
| 394 | 1904 | 0.2% |
| 429 | 1860 | 0.2% |
| 241 | 1803 | 0.2% |
| Other values (723) | 499091 |
Most occurring characters
| Value | Count | Frequency (%) |
| 515738 | ||
| d | 515738 | |
| a | 515738 | |
| y | 515738 | |
| 3 | 185784 | 5.2% |
| 1 | 177517 | 4.9% |
| 2 | 177377 | 4.9% |
| 4 | 170996 | 4.7% |
| 6 | 163556 | 4.5% |
| 5 | 159080 | 4.4% |
| Other values (5) | 504631 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3601893 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 515738 | ||
| d | 515738 | |
| a | 515738 | |
| y | 515738 | |
| 3 | 185784 | 5.2% |
| 1 | 177517 | 4.9% |
| 2 | 177377 | 4.9% |
| 4 | 170996 | 4.7% |
| 6 | 163556 | 4.5% |
| 5 | 159080 | 4.4% |
| Other values (5) | 504631 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3601893 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 515738 | ||
| d | 515738 | |
| a | 515738 | |
| y | 515738 | |
| 3 | 185784 | 5.2% |
| 1 | 177517 | 4.9% |
| 2 | 177377 | 4.9% |
| 4 | 170996 | 4.7% |
| 6 | 163556 | 4.5% |
| 5 | 159080 | 4.4% |
| Other values (5) | 504631 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3601893 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 515738 | ||
| d | 515738 | |
| a | 515738 | |
| y | 515738 | |
| 3 | 185784 | 5.2% |
| 1 | 177517 | 4.9% |
| 2 | 177377 | 4.9% |
| 4 | 170996 | 4.7% |
| 6 | 163556 | 4.5% |
| 5 | 159080 | 4.4% |
| Other values (5) | 504631 |
lat
Real number (ℝ)
| Distinct | 1472 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3268 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.442439 |
| Minimum | 41.328376 |
|---|---|
| Maximum | 52.400181 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 41.328376 |
|---|---|
| 5-th percentile | 41.386146 |
| Q1 | 48.214662 |
| median | 51.499981 |
| Q3 | 51.516288 |
| 95-th percentile | 52.36813 |
| Maximum | 52.400181 |
| Range | 11.071805 |
| Interquartile range (IQR) | 3.301626 |
Descriptive statistics
| Standard deviation | 3.4663252 |
|---|---|
| Coefficient of variation (CV) | 0.070108298 |
| Kurtosis | 0.65444514 |
| Mean | 49.442439 |
| Median Absolute Deviation (MAD) | 0.0591145 |
| Skewness | -1.4036504 |
| Sum | 25337767 |
| Variance | 12.015411 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 51.5019097 | 4789 | 0.9% |
| 51.5110993 | 4256 | 0.8% |
| 51.5009609 | 4169 | 0.8% |
| 51.499046 | 3578 | 0.7% |
| 51.5108412 | 3212 | 0.6% |
| 51.5109945 | 2958 | 0.6% |
| 51.499981 | 2768 | 0.5% |
| 51.5195688 | 2628 | 0.5% |
| 51.4935083 | 2565 | 0.5% |
| 51.5024348 | 2551 | 0.5% |
| Other values (1462) | 478996 | |
| (Missing) | 3268 | 0.6% |
| Value | Count | Frequency (%) |
| 41.3283758 | 572 | |
| 41.368437 | 575 | |
| 41.3703041 | 229 | < 0.1% |
| 41.371308 | 1082 | |
| 41.3725246 | 120 | < 0.1% |
| 41.3727844 | 265 | 0.1% |
| 41.3732462 | 797 | |
| 41.3747031 | 179 | < 0.1% |
| 41.3747873 | 158 | < 0.1% |
| 41.3750293 | 932 |
| Value | Count | Frequency (%) |
| 52.4001813 | 312 | 0.1% |
| 52.3924898 | 467 | 0.1% |
| 52.3923684 | 143 | < 0.1% |
| 52.3872884 | 856 | |
| 52.3856494 | 1071 | |
| 52.385601 | 1686 | |
| 52.3846059 | 916 | |
| 52.3840358 | 108 | < 0.1% |
| 52.3793659 | 845 | |
| 52.3786823 | 594 | 0.1% |
lng
Real number (ℝ)
| Distinct | 1472 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3268 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.823803 |
| Minimum | -0.3697581 |
|---|---|
| Maximum | 16.429233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 256226 |
| Negative (%) | 49.7% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | -0.3697581 |
|---|---|
| 5-th percentile | -0.1947475 |
| Q1 | -0.143372 |
| median | 0.010607 |
| Q3 | 4.834443 |
| 95-th percentile | 16.356445 |
| Maximum | 16.429233 |
| Range | 16.798991 |
| Interquartile range (IQR) | 4.977815 |
Descriptive statistics
| Standard deviation | 4.5794253 |
|---|---|
| Coefficient of variation (CV) | 1.6217227 |
| Kurtosis | 2.7791456 |
| Mean | 2.823803 |
| Median Absolute Deviation (MAD) | 0.2941333 |
| Skewness | 1.8964221 |
| Sum | 1447114.3 |
| Variance | 20.971136 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.0232208 | 4789 | 0.9% |
| -0.1208673 | 4256 | 0.8% |
| -0.1165913 | 4169 | 0.8% |
| -0.1917073 | 3578 | 0.7% |
| -0.0780581 | 3212 | 0.6% |
| -0.1863417 | 2958 | 0.6% |
| -0.1928791 | 2768 | 0.5% |
| -0.170521 | 2628 | 0.5% |
| -0.1834346 | 2565 | 0.5% |
| -0.0002497 | 2551 | 0.5% |
| Other values (1462) | 478996 | |
| (Missing) | 3268 | 0.6% |
| Value | Count | Frequency (%) |
| -0.3697581 | 413 | 0.1% |
| -0.3192925 | 391 | 0.1% |
| -0.306071 | 128 | < 0.1% |
| -0.2915052 | 385 | 0.1% |
| -0.290706 | 680 | 0.1% |
| -0.2864945 | 1212 | |
| -0.284704 | 1848 | |
| -0.2835263 | 2227 | |
| -0.282992 | 197 | < 0.1% |
| -0.2787261 | 1251 |
| Value | Count | Frequency (%) |
| 16.4292329 | 41 | < 0.1% |
| 16.4219737 | 224 | |
| 16.4217627 | 426 | |
| 16.4210093 | 361 | |
| 16.4200957 | 431 | |
| 16.417026 | 143 | < 0.1% |
| 16.4133973 | 191 | < 0.1% |
| 16.4129493 | 501 | |
| 16.4116997 | 92 | < 0.1% |
| 16.4082294 | 63 | < 0.1% |
sample
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 515738 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 515738 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 515738 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 386803 | |
| 0 | 128935 | 25.0% |
reviewer_score
Real number (ℝ)
High correlation  Zeros 
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2976717 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 128935 |
| Zeros (%) | 25.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.625 |
| median | 7.9 |
| Q3 | 9.6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 8.975 |
Descriptive statistics
| Standard deviation | 3.902295 |
|---|---|
| Coefficient of variation (CV) | 0.61964091 |
| Kurtosis | -1.0623936 |
| Mean | 6.2976717 |
| Median Absolute Deviation (MAD) | 1.7 |
| Skewness | -0.78737492 |
| Sum | 3247948.6 |
| Variance | 15.227906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 128935 | |
| 10 | 86803 | |
| 9.6 | 53502 | |
| 9.2 | 44053 | 8.5% |
| 8.8 | 34795 | 6.7% |
| 8.3 | 30903 | 6.0% |
| 7.5 | 26164 | 5.1% |
| 7.9 | 24901 | 4.8% |
| 7.1 | 18529 | 3.6% |
| 6.7 | 14117 | 2.7% |
| Other values (28) | 53036 |
| Value | Count | Frequency (%) |
| 0 | 128935 | |
| 2.5 | 1632 | 0.3% |
| 2.9 | 1211 | 0.2% |
| 3 | 25 | < 0.1% |
| 3.1 | 6 | < 0.1% |
| 3.3 | 2063 | 0.4% |
| 3.5 | 61 | < 0.1% |
| 3.8 | 3017 | 0.6% |
| 4 | 66 | < 0.1% |
| 4.2 | 3827 | 0.7% |
| Value | Count | Frequency (%) |
| 10 | 86803 | |
| 9.6 | 53502 | |
| 9.5 | 523 | 0.1% |
| 9.4 | 47 | < 0.1% |
| 9.2 | 44053 | |
| 9 | 483 | 0.1% |
| 8.8 | 34795 | |
| 8.5 | 379 | 0.1% |
| 8.3 | 30903 | 6.0% |
| 8.1 | 28 | < 0.1% |
Interactions
Correlations
| additional_number_of_scoring | average_score | lat | lng | review_total_negative_word_counts | review_total_positive_word_counts | reviewer_score | sample | total_number_of_reviews | total_number_of_reviews_reviewer_has_given | |
|---|---|---|---|---|---|---|---|---|---|---|
| additional_number_of_scoring | 1.000 | -0.128 | 0.426 | -0.385 | 0.049 | -0.057 | -0.027 | 0.000 | 0.859 | -0.105 |
| average_score | -0.128 | 1.000 | -0.086 | 0.180 | -0.159 | 0.139 | 0.200 | 0.000 | -0.194 | 0.041 |
| lat | 0.426 | -0.086 | 1.000 | -0.324 | 0.036 | -0.025 | -0.015 | 0.000 | 0.151 | -0.101 |
| lng | -0.385 | 0.180 | -0.324 | 1.000 | -0.050 | 0.060 | 0.035 | 0.000 | -0.044 | 0.117 |
| review_total_negative_word_counts | 0.049 | -0.159 | 0.036 | -0.050 | 1.000 | 0.023 | -0.265 | 0.000 | 0.052 | 0.008 |
| review_total_positive_word_counts | -0.057 | 0.139 | -0.025 | 0.060 | 0.023 | 1.000 | 0.177 | 0.000 | -0.040 | 0.047 |
| reviewer_score | -0.027 | 0.200 | -0.015 | 0.035 | -0.265 | 0.177 | 1.000 | 1.000 | -0.043 | -0.014 |
| sample | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.003 |
| total_number_of_reviews | 0.859 | -0.194 | 0.151 | -0.044 | 0.052 | -0.040 | -0.043 | 0.000 | 1.000 | -0.039 |
| total_number_of_reviews_reviewer_has_given | -0.105 | 0.041 | -0.101 | 0.117 | 0.008 | 0.047 | -0.014 | 0.003 | -0.039 | 1.000 |
Missing values
Sample
| hotel_address | additional_number_of_scoring | review_date | average_score | hotel_name | reviewer_nationality | negative_review | review_total_negative_word_counts | total_number_of_reviews | positive_review | review_total_positive_word_counts | total_number_of_reviews_reviewer_has_given | tags | days_since_review | lat | lng | sample | reviewer_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Via Senigallia 6 20161 Milan Italy | 904 | 7/21/2017 | 8.1 | Hotel Da Vinci | United Kingdom | Would have appreciated a shop in the hotel that sold drinking water etc but not necessity Would recommend if like us you arrive late at night to bring drinks from plane airport as there s no shop nearby There is a minibar though if you want to pay those prices | 52 | 16670 | Hotel was great clean friendly staff free breakfast every morning with good selection good wifi connection nice sized room with bath fridge in room Personally loved the fact that the hotel isn t in the city centre but is literally next to a train station that you can easily get to and from the airport city Would definitely stay again | 62 | 1 | [' Leisure trip ', ' Couple ', ' Double Room ', ' Stayed 2 nights '] | 13 days | 45.533137 | 9.171102 | 0 | 0.0 |
| 1 | Arlandaweg 10 Westpoort 1043 EW Amsterdam Netherlands | 612 | 12/12/2016 | 8.6 | Urban Lodge Hotel | Belgium | No tissue paper box was present at the room | 10 | 5018 | No Positive | 0 | 7 | [' Leisure trip ', ' Group ', ' Triple Room ', ' Stayed 1 night '] | 234 day | 52.385649 | 4.834443 | 0 | 0.0 |
| 2 | Mallorca 251 Eixample 08008 Barcelona Spain | 46 | 11/26/2015 | 8.3 | Alexandra Barcelona A DoubleTree by Hilton | Sweden | Pillows | 3 | 351 | Nice welcoming and service | 5 | 15 | [' Business trip ', ' Solo traveler ', ' Twin Room ', ' Stayed 1 night ', ' Submitted from a mobile device '] | 616 day | 41.393192 | 2.161520 | 0 | 0.0 |
| 3 | Piazza Della Repubblica 17 Central Station 20124 Milan Italy | 241 | 10/17/2015 | 9.1 | Hotel Principe Di Savoia | United States of America | No Negative | 0 | 1543 | Everything including the nice upgrade The Hotel has been revamped and what a surprise Love every second of it including in room dining which was excellent | 27 | 9 | [' Leisure trip ', ' Couple ', ' Ambassador Junior Suite ', ' Stayed 1 night '] | 656 day | 45.479888 | 9.196298 | 0 | 0.0 |
| 4 | Singel 303 309 Amsterdam City Center 1012 WJ Amsterdam Netherlands | 834 | 5/16/2016 | 9.1 | Hotel Esther a | United Kingdom | No Negative | 0 | 4687 | Lovely hotel v welcoming staff | 7 | 2 | [' Business trip ', ' Solo traveler ', ' Classic Double or Twin Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 444 day | 52.370545 | 4.888644 | 0 | 0.0 |
| 5 | Coram Street Camden London WC1N 1HT United Kingdom | 709 | 8/13/2015 | 8.2 | Holiday Inn London Bloomsbury | Ecuador | They don t have free wifi | 7 | 2995 | The location is perfect if you don t have a lot of time and you want to have a look at the city centre | 26 | 3 | [' Business trip ', ' Solo traveler ', ' Standard Double or Twin Room ', ' Stayed 1 night '] | 721 day | 51.524125 | -0.125807 | 0 | 0.0 |
| 6 | Empire Way Wembley Brent London HA9 8DS United Kingdom | 1005 | 8/18/2016 | 8.3 | Holiday Inn London Wembley | United Kingdom | Room generally a bit shabby with some lack of maintenance Some crumbs on bedroom floor these issues did not spoil our minibreak It would be nice to have vegetarian sausages available for breakfast | 35 | 3469 | Location price It did not cost much more to have breakfast included Room was a reasonable size and bed was comfortable | 23 | 11 | [' Leisure trip ', ' Couple ', ' Queen Room ', ' Stayed 1 night '] | 350 day | 51.559095 | -0.284704 | 0 | 0.0 |
| 7 | 1 Shortlands Hammersmith and Fulham London W6 8DR United Kingdom | 704 | 8/11/2015 | 8.3 | Novotel London West | Netherlands | Executive rooms 9th Floor don t have a bath Their website made it look like all rooms did have one and when being at the end of a hall there s no wifi connection possible Mind that during my first two stays here I did have a perfect wifi connection | 52 | 2443 | Comphy bed upgraded to executive room with nespresso machine etc for only 24 3 nights quiet room clean 4 free waters in the fridge tho no refill and close to Hammersmith station shops and Starbucks Olympia is in walking distance too | 42 | 38 | [' Business trip ', ' Solo traveler ', ' Executive Room ', ' Stayed 3 nights ', ' Submitted from a mobile device '] | 723 day | 51.491959 | -0.220096 | 0 | 0.0 |
| 8 | 35 Rue Caumartin 9th arr 75009 Paris France | 211 | 6/25/2016 | 8.9 | Hotel Saint Petersbourg Opera | Ireland | Pity about the two days of rain | 8 | 2412 | Its centrality proximity to our destination | 7 | 1 | [' Group ', ' Double or Twin Room ', ' Stayed 1 night '] | 404 day | 48.872174 | 2.328075 | 0 | 0.0 |
| 9 | 49 Gloucester Place Marble Arch Westminster Borough London W1U 8JE United Kingdom | 61 | 9/30/2015 | 7.4 | St George Hotel | Canada | Didn t like it at all construction was in progress stuff lied to us about vacancy | 18 | 334 | Didn t like anything about the stay if i had a chance to change or cancel it I would do it right away | 25 | 1 | [' Couple ', ' Standard Triple Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 673 day | 51.518277 | -0.158351 | 0 | 0.0 |
| hotel_address | additional_number_of_scoring | review_date | average_score | hotel_name | reviewer_nationality | negative_review | review_total_negative_word_counts | total_number_of_reviews | positive_review | review_total_positive_word_counts | total_number_of_reviews_reviewer_has_given | tags | days_since_review | lat | lng | sample | reviewer_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 515728 | 3 rue de Ponthieu 8th arr 75008 Paris France | 70 | 12/1/2016 | 8.7 | H tel Mathis Elys es | United States of America | No Negative | 0 | 652 | Location | 2 | 5 | [' Leisure trip ', ' Group ', ' Junior Suite ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 245 day | 48.870033 | 2.311274 | 1 | 9.6 |
| 515729 | 15 Rue Boissy d Anglas 8th arr 75008 Paris France | 91 | 12/21/2016 | 8.5 | Sofitel Paris Le Faubourg | United Arab Emirates | No Negative | 0 | 564 | Location was perfect Room was very comfortable spacious | 10 | 40 | [' Leisure trip ', ' Solo traveler ', ' Luxury Room 1 Queensize Bed Twin bedded Room On Request ', ' Stayed 6 nights ', ' Submitted from a mobile device '] | 225 day | 48.868414 | 2.321325 | 1 | 9.2 |
| 515730 | 52 56 Inverness Terrace Westminster Borough London W2 3LB United Kingdom | 545 | 10/12/2015 | 8.0 | Shaftesbury Hyde Park International | United Kingdom | staff miserable and room very small | 7 | 2907 | Outstanding location | 3 | 1 | [' Leisure trip ', ' Couple ', ' Deluxe Double Room ', ' Stayed 1 night '] | 661 day | 51.512397 | -0.186124 | 1 | 5.8 |
| 515731 | 22 Portman Square Westminster Borough London W1H 7BG United Kingdom | 597 | 8/16/2016 | 7.9 | Radisson Blu Portman Hotel London | United Kingdom | Room was very small so much so that I kept hitting myself on the TV that was mounted on the wall Bed was very soft and pillows were awful | 30 | 2308 | Staff were friendly and efficient | 6 | 3 | [' Leisure trip ', ' Couple ', ' Standard Double Room ', ' Stayed 1 night ', ' Submitted from a mobile device '] | 352 day | 51.516191 | -0.157949 | 1 | 5.0 |
| 515732 | 24 Ludgate Hill City of London London EC4M 7DR United Kingdom | 918 | 11/8/2015 | 8.4 | Club Quarters Hotel St Paul s | Sweden | Our room was really cold and we had problem with the heater They had to bring a portable heater to fix the issue | 25 | 4117 | It is a nice and clean hotel with a good location | 13 | 3 | [' Leisure trip ', ' Group ', ' Standard Queen Room ', ' Stayed 3 nights '] | 634 day | 51.513930 | -0.101126 | 1 | 7.9 |
| 515733 | 9 Knaresborough Place Kensington and Chelsea London SW5 0TP United Kingdom | 107 | 4/19/2017 | 9.0 | Hotel Moonlight | France | No Negative | 0 | 617 | Tr s proche du metro Earl s court | 10 | 10 | [' Leisure trip ', ' Group ', ' Club Double or Twin Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 106 day | 51.494028 | -0.191050 | 1 | 8.8 |
| 515734 | Landstra er Hauptstra e 155 03 Landstra e 1030 Vienna Austria | 272 | 2/13/2017 | 8.4 | BEST WESTERN PLUS Amedia Wien | Turkey | No Negative | 0 | 3224 | The bed was so comfy I stayed with my boyfriend we had a double bed Also transportation is excellent the hotel is very very close to Old City Once you exit the hotel just turn right about 50m away there is a bus stop get off on Stubentor it is the last stop It only takes 10min Also you can take the same bus back to the hotel The bus name is 74A St Marx The hotel was very clean and the room that we accomidated in was nice and roomy | 93 | 1 | [' Leisure trip ', ' Couple ', ' Standard Double Room ', ' Stayed 4 nights ', ' Submitted from a mobile device '] | 171 day | 48.192379 | 16.399451 | 1 | 9.2 |
| 515735 | 29 31 Gower Street Camden London WC1E 6HG United Kingdom | 457 | 2/7/2016 | 6.8 | Bloomsbury Palace Hotel | Netherlands | room is really small but guess is normal in London | 12 | 2751 | great location simple check in out nice shower | 9 | 21 | [' Business trip ', ' Solo traveler ', ' Single Room ', ' Stayed 1 night '] | 543 day | 51.520795 | -0.131084 | 1 | 8.3 |
| 515736 | 31 Great Cumberland Place Westminster Borough London W1H 7TA United Kingdom | 365 | 5/21/2017 | 8.1 | The Marble Arch London | United Arab Emirates | No Negative | 0 | 1567 | Location and very comfy bed | 6 | 28 | [' Leisure trip ', ' Solo traveler ', ' Deluxe Double Room ', ' Stayed 2 nights '] | 74 days | 51.515125 | -0.160066 | 1 | 9.2 |
| 515737 | 25 Courtfield Gardens Kensington and Chelsea London SW5 0PG United Kingdom | 222 | 8/5/2016 | 9.0 | The Nadler Kensington | Australia | Patio outside could have been cleaned of algae to give a more uplifting atmosphere to a downstairs room | 20 | 1209 | Beds comfortable Pillows also good Homely feel although room was small Staff very pleasant and helpful thank you | 20 | 2 | [' Leisure trip ', ' Couple ', ' Bunk Bed Room ', ' Stayed 4 nights '] | 363 day | 51.493109 | -0.190208 | 1 | 8.8 |
Duplicate rows
Most frequently occurring
| hotel_address | additional_number_of_scoring | review_date | average_score | hotel_name | reviewer_nationality | negative_review | review_total_negative_word_counts | total_number_of_reviews | positive_review | review_total_positive_word_counts | total_number_of_reviews_reviewer_has_given | tags | days_since_review | lat | lng | sample | reviewer_score | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 16 22 Great Russell Street Camden London WC1B 3NN United Kingdom | 300 | 7/27/2017 | 9.0 | The Bloomsbury Hotel | Israel | No Negative | 0 | 1254 | The attention received by Sebastian and his team was exceptional | 12 | 4 | [' Leisure trip ', ' Couple ', ' Superior Double Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 7 days | 51.517167 | -0.129053 | 1 | 9.6 | 2 |
| 1 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 10/14/2016 | 6.8 | Villa Eugenie | Iran | Evry thing was wrong Cold room Dark room No refrigetor in room No ac Evry thing was baaaaaaad Very bad | 21 | 165 | This hotel was terrible this place worst place in paris Hostel is better than this place I m sorry for myself to spend my time in this place | 29 | 2 | [' Solo traveler ', ' Single Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 293 day | 48.887128 | 2.314205 | 1 | 2.5 | 2 |
| 2 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 10/14/2016 | 6.8 | Villa Eugenie | United Kingdom | Evry thing of this place i can t name horlrel to this place was wrong and out of repair all lamp were nt work 2nights i got cold in cold room Ac not working i so sorry for my self that i had to spend my time in this ruin place 4star hotel has nt refrigerator | 58 | 165 | This hotel is worst hotel Its terrible | 9 | 1 | [' Solo traveler ', ' Single Room ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 293 day | 48.887128 | 2.314205 | 1 | 2.5 | 2 |
| 3 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 10/18/2016 | 6.8 | Villa Eugenie | Israel | No Negative | 0 | 165 | the room was very french and beautiful it was good location and I enjoyed to stay there | 18 | 1 | [' Leisure trip ', ' Couple ', ' Twin Room ', ' Stayed 3 nights '] | 289 day | 48.887128 | 2.314205 | 1 | 9.2 | 2 |
| 4 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 10/2/2016 | 6.8 | Villa Eugenie | Qatar | Staff very rude My credit card was charged Before my stay and while Checking out they charged me again when I told the receptionist about it her answer was I m not trying to steal your money madam in a ver un polite way Blaming my Bank about it Very poor selection for breakfast | 56 | 165 | Nothing | 3 | 8 | [' Leisure trip ', ' Family with young children ', ' Two Connecting Double Rooms ', ' Stayed 2 nights ', ' Submitted from a mobile device '] | 305 day | 48.887128 | 2.314205 | 1 | 5.0 | 2 |
| 5 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 12/12/2016 | 6.8 | Villa Eugenie | Canada | Listed above | 3 | 165 | It was a terrible stat unfriendly staff very unprofessional and dirty rooms | 13 | 1 | [' Business trip ', ' Solo traveler ', ' Standard Double or Twin Room ', ' Stayed 6 nights ', ' Submitted from a mobile device '] | 234 day | 48.887128 | 2.314205 | 0 | 0.0 | 2 |
| 6 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 5/26/2016 | 6.8 | Villa Eugenie | France | very dated d cor certainly NOT a 4 star hotel | 11 | 165 | bed was confy central friendliy staff | 7 | 1 | [' Business trip ', ' Solo traveler ', ' Standard Double or Twin Room ', ' Stayed 2 nights '] | 434 day | 48.887128 | 2.314205 | 1 | 6.7 | 2 |
| 7 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 8/2/2017 | 6.8 | Villa Eugenie | United States of America | Place is old not worthy of 4 stars | 9 | 165 | Location friendly staff cell phone for use during stay in Paris | 12 | 21 | [' Leisure trip ', ' Solo traveler ', ' Standard Double or Twin Room ', ' Stayed 1 night ', ' Submitted from a mobile device '] | 1 days | 48.887128 | 2.314205 | 1 | 7.1 | 2 |
| 8 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 9/19/2015 | 6.8 | Villa Eugenie | Switzerland | Facilities did not function Sink was blocked Something heavy fell If I had been in the room I would have been injured Way overpriced for what it is | 30 | 165 | The staff tried | 5 | 4 | [' Business trip ', ' Solo traveler ', ' Standard Double or Twin Room ', ' Stayed 1 night '] | 684 day | 48.887128 | 2.314205 | 1 | 4.2 | 2 |
| 9 | 167 rue de Rome 17th arr 75017 Paris France | 11 | 9/22/2016 | 6.8 | Villa Eugenie | Italy | Bed bugs air condition not work | 7 | 165 | Front office is helpfull | 5 | 1 | [' Leisure trip ', ' Family with young children ', ' Standard Double or Twin Room ', ' Stayed 1 night ', ' Submitted from a mobile device '] | 315 day | 48.887128 | 2.314205 | 1 | 5.0 | 2 |